Cooperative Cognitive Networks: 
Optimal, Distributed and Low- Complexity Algorithms 



<: 

<N 



Gan Zheng* S. H. Song| Kai-Kit Wongf and Bjorn Ottersten* 

(N 

S Abstract 
CN 

This paper considers the cooperation between a cognitive system and a primary system where multi- 
ple cognitive base stations (CBSs) relay the primary user's (PU) signals in exchange for more opportunity 
to transmit their own signals. The CBSs use amplify-and-forward (AF) relaying and coordinated beam- 
t— I forming to relay the primary signals and transmit their own signals. The objective is to minimize the 
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HH overall transmit power of the CBSs given the rate requirements of the PU and the cognitive users (CUs). 

O We show that the relaying matrices have unit rank and perform two functions: Matched filter receive 

^ beamforming and transmit beamforming. We then develop two efficient algorithms to find the optimal 

^ solution. The first one has linear convergence rate and is suitable for distributed implementation, while 
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the beamforming vectors for the linear conventional zero-forcing (CZF) and prior zero-forcing (PZF) 
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I. Introduction 



Communications over wireless channels continue to be major challenges of today's technologies mainly due 
to spectrum scarcity and channel fading characteristics. While spectrum utilization depends very much on 
place and time, it has been well known that most spectrum is heavily under-utilized [1]. Cognitive radio 
system (CRS) [2] is a new paradigm to improve the spectrum efficiency by allowing a secondary system 
to access the spectrum licensed to the primary system. In a typical setup, the primary users (PUs) have 
the priority to access the spectrum, while CRS can occupy the spectrum only if it does not interrupt the 
communication of the primary system. In practice, this either requires the CRS to sense and detect the 
spectrum holes and then access the spectrum opportunistically or demands the interference from CRS to 
the primary system to be properly controlled. In either case, the primary system and CRS are working 
separately. A major challenge therefore arises to guarantee the quality-of-service (QoS) of cognitive users 
(CUs) without degrading the primary system performance. In light of this, a number of beamforming 
techniques have been proposed to achieve various related objectives assuming perfect [3], partial [4] or 
imperfect channel state information (CSI) [5, 6] available at the CRS regarding the primary system. 

On the other hand, cooperative communication, especially via relaying, is a promising countermeasure 
for channel fading. Relaying strategies may be categorized into three main types: 1) compress-and-forward 
(CF) 2) amplify-and-forward (AF) and 3) decode-and-forward (DF). Among them, AF, in which the relay 
simply performs linear processing on the received noisy signal from the sender and forwards it to the 
destination, is arguably the most attractive strategy, due to its relatively low implementation complexity. 
Interested readers are referred to the special issues such as [7] for more details on this topic. 

Cooperation can be used to improve the reliability performance of CRS as well as the primary system. 
The advantages of cooperation are manifold. It can help the CRS forward the primary and cognitive signals 
[8], spectrum sensing [9, 10] and save energy in the process of cooperative sensing and transmission [11]. 
Without introducing additional relay nodes, the CRS itself can serve as a relay. Cognitive relaying was 
proposed in multiple access control (MAC) layer [12] where CRS directly helps to relay the traffic of the 
primary system. The benefit is that the PUs' performance can be improved due to the diversity paths via 
CBS and in return, CRS gets a higher chance to utilize the spectrum, hence increasing the overall spectral 
efficiency. This is particularly important to the primary system when the PUs' instantaneous data rate 
or outage probability cannot be satisfied, due to a weak PU link. For this reason, the primary system is 
incentivised to provide necessary CSI to the CRS for cooperation. 
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Multiple antennas at the CBS can also greatly benefit the cognitive relaying, but it is relatively un- 
explored. In [14], cognitive beamforming was devised to broadcast a common message to the CUs in the 
downlink using semidefinite programming (SDP) relaxation. Nevertheless, in the case of AF relaying, the 
results in [14] are limited because CBS only uses a vector to linearly process and forward the signal from the 
primary base station (PBS), and thus is unable to fully exploit the multiple antennas at the CBS. In [13], 
a DF relaying protocol was considered for the downlink and prior zero- forcing (PZF) , which gives priority 
to the primary system, was proposed and compared with the conventional zero-forcing (CZF) algorithm. 
A rate threshold was also given as a guideline on which method is preferred. Despite these early works, 
joint optimal relay processing and transmit beamforming is unknown even with one CBS. 

A. Our Aims and Contributions 

This paper studies the downlink scenario where several multi-antenna CBSs help to relay the PU signal 
using AF strategy while serving the CUs.^ Our aim is to minimize the total CBS transmit power (for both 
relaying and own transmission) for given PU and CU target rates by optimizing the relaying matrices and 
the coordinated transmit beamforming vectors [15]. Regarding the requirement of CSI, PBS does not need 
to learn its channels to CRS because its transmission strategy is unchanged. It is the CBSs who should 
acquire the global CSI, including CSI from the PBS, to the PU and to all CUs. The first is received CSI 
and can be estimated by training; the second can also be obtained either via the help of PBS or by listening 
to the feedback from the PU during the channel estimation of the primary system, and this is possible 
since in order to cooperate, CBSs should not be too far from the PBS. For the third channels from one 
CBS to all CUs, they can be learned by feedback from the CUs involved. In summary, perfect, either local 
or global, CSI is assumed at the CBSs. We have made the following contributions: 

• We consider cooperation among multiple CBSs using coordinated beamforming in addition to the 
cooperation between CBS and the primary system for enhancing the overall spectrum efficiency. The 
joint design of coordinated beamforming and collaborative cognitive relaying is original and novel. 

• The optimal structure of the relaying matrices is shown to match the backward channels for max- 
imizing the received signal-to-noise ratio (SNR) of the PU signal and align the noisy PU signal 

collaboratively using relay beamforming. This structure greatly simplifies the original problem. 

^As a possible application, the cooperative cognitive system will be useful to repair a satellite link which suffers from 
high path loss and rain fading. For instance, several terrestrial base stations can form a cognitive network and use the same 
spectrum to serve their own users, while at the same time help relay the satellite signal from the ground station. This improves 
the availability and saves on board satellite power, which is of great importance for extending satellite lifetime. 
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• Two iterative methods are devised to achieve the optimal solution to the total power minimization 
problem. The first one is based on a fixed-point iteration, has linear convergence, and permits a 
distributed implementation that requires only local CSI at the CBS while another method utilizes 
matrix iteration, has superlinear convergence but requires global CSI at the CBS. 

• In addition, we derive closed-form expressions for the beamforming vectors for the CZF and PZF 

schemes, which provide linear and simple solutions. A Tinique rate requirement threshold is given to 
differentiate the advantages between CZF and PZF for the case with only one CBS and one CU. 

B. Notations 

Throughout this paper, the following notations will be adopted. Vectors and matrices are represented by 
boldface lowercase and uppercase letters, respectively. || • \\x is the Ix norm and || • || denotes the Frobenius 
norm. vec(xi, . . . , xjy^) returns a column vector by stacking all the elements of the input vectors xi, . . . , xm 
in order. (•)^ denotes the Hermitian operation of a vector or matrix, while trace(A) returns the trace of 
A. The notation A G M*^'<^ indicates that A is a real matrix with dimension M x N. A >~ means that 
A is positive semi-definite. In addition, A oc B means that A = cB where c is a constant. A^^„ denotes 
the (m, n)-th element of A. Also, BlkDiag(Ai, . . . , Am) returns a block diagonal matrix with Ai, . . . , Am 
being diagonal matrices. I denotes an identity matrix of appropriate sizes. Finally, x ~ CAf{m, 0) denotes 
a vector x of complex Gaussian entries with a mean vector of m and a covariance matrix of 0. 

II. Network Model and Problem Formulation 

A. The Primciry and Cognitive System 

In this paper, we consider a primary-secondary downlink network as shown in Fig. 1. The primary system 

consists of a iC-antenna PBS communicating to a single-antenna PU, whereas the secondary system has M 

CBSs each with N transmit/receive antennas and serving a single-antenna CU.^ Without loss of generality, 

it is assumed that the m-th CU and its serving CBS are denoted by CUm and CBSm, respectively. Of our 

interest is the scenario where cooperation between CBSs and PBS is necessary to support an acceptable 

QoS of the primary system. Various possible working modes of cooperation will be discussed in Section 
^The extension to the case that one CBS serves multiple CUs is straightforward. 
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II-C. To describe our network model, we define 



1 X K channel vector between the PBS and the PU; 
1 X A'' channel vector between CBS^ and the PU; 
1 X N channel vector between CBSj and CU^; 
channel between the PBS and CBS^; 

noise received at the PU during phase I with ^ CAf{0, N[); 
noise received at the PU during phase II with ~ CM{0, N2); 
N X 1 noise vector received at CBS^ during phase I with ngm ^ CM{0, Ngm^)', 
noise received at CU^ during phase II with rim ~ CM{0, Nm)', 
transmit power of PBS; 

transmit signal for the PU with so ^ CAf{0, Pq); 
transmit signal for C[im with Sm ~ CJ\f{0, 1). 

All channels and noises are independent of each other, and the CBSs operate in an AF and half-duplex 
fashion. The communication is synchronous and takes place in two phases. In phase I, PBS broadcasts its 
message sq by x = fso where f is a fixed unit- norm transmit beamforming vector which is either or 
to be designed by the PBS. The received signals at the PU and CBS^ are, respectively, given by 

yi = hgtfso + <, (1) 

I'm = Gm^So + Usm = gm^O + ^sm, (2) 

where gm — Gmf,^rn. Then, CBS^ processes the received signal using an N x N relaying matrix to 
produce A^rm = AmgmSo + ^mi^sm- In phase II, CBS^ sends its own message, Sm, using a beamforming 
vector {wm} together with the processed signals AmFm- We assume that data symbols for the CUs, {sm}, 
and the PU, sq, are mutually independent of each other. During this period, the PBS remains idle and the 
received signal at CU^ is then given by 

M 

Zm = ^Inm (Wm^m + A^g^So + A^n^^) + ^ hj^ {wjSj + AjgjSQ + AjUgj) + n^. (3) 

J=l 



hg^ the 

hJno the 
the 

Gm the 
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The received signal-to-interference plus noise ratio (SINR) at C\Jm is then expressed as 

CIMRI ^ |flmmWm| . s 

^"^'^Icu^ V^, Ih^ w |2 + PnV^ Ih^ Ae |2 + V^ AT llh^ A-P + N ' 
Similarly, the received signal at the PU is given by 

M 

M MM 

j=l j=l j=l 

Using maximal-ratio combining (MRC) to get an estimate of sq, the received SINR of the PU becomes 

SINRIpu - Po — ^ + —TT—T ,o , ,„ t . no . • W 



Nf V*^ lh"fw|2 + V^ iV •llh'^ A -IP + ATP 
1 Z^i=i l"jo"'jl Z^j=i ^^sjll"jO-^:'ll T^^^2 



B. Problem Formulation 



Assuming perfect CSI, we aim to minimize the transmit power of CBSs subject to both PU and CUs' rate 
constraints ro and {rm}, respectively, by jointly optimizing {w^} and {A^}. Mathematically, that is, 

M M M 

min Yl ll^ill' + ^0 E Il^^-Sjll' + E^^ill^jll' 

iw,-,A,.> .^^ .^^ .^^ 

Ih^ w |2 



y-M wP + PnV"^ Ih^ A-e-|2 + V^ iV -llh^ A-IP + AT - " ^ -i,vm. 



In this paper, we mainly consider the case that the original primary link hg is weak and cooperation 
will be beneficial to the primary system. The above formulation has assumed that ro is greater than the 
rate that is achievable by the primary link alone. Therefore, it is necessary to use cooperation to improve 
the spectral efficiency for the PU. To understand this, the achievable rate for the PU depends on: 1) PU 
transmit power Pq, 2) the channel quality of PBS-CBS link gj and 3) CBS relay power. 1) is obvious while 
2) and 3) can be understood by the fact that the performance of an AF relay system is upper bounded 
by the performance of both phases. We stress that the optimal cooperation scheme between the primary 
system and CRSs is unknown and the target rate ro may well below the maximum achievable rate. 
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C. Working Modes 

Cooperation is particularly essential if the primary system is in an outage. However, when the primary 
system is in a good state, whether cooperation is needed depends on the expected return and the complexity 
it involves. In the following, we outline several possible working modes. 

Mode I: When the primary system is in an outage, PBS invites CBSs to cooperate. The necessary CSI such 
as Ihg^f I, gj and hjo can either be fed back to or estimated at CBSj with the help from the PBS and 
the PU. The CBSs will obtain the solution to (7) and make it known to the relevant terminals.^ 

Mode II: When the primary system is in a good channel state to support its rate, then 

(a) Mode II. 1 — PBS does not need help from CBSs and gives no access of spectrum to CBSs. 

(b) Mode II. 2 — PBS allows CBSs to access the spectrum under the condition that the interference 
caused by the CBSs is below some threshold Ip or "interference temperature". The PBS will 
assist the CBSs to obtain the knowledge {hjo} to ensure acceptable interference. In this case, 
the CBSs do not relay the PBS's signal but simply transmit their own messages with interference 
control to the PU. To be specific, the transmission schemes at CBSs are optimized by solving 



min 

m=l 



M 



■'■■'■mm vVtti, 



Ef^i |ht w,f + 7V„ 



> 7m, 



s.t. < ' 3' 

or similarly, e.g., to maximize the CU's rate. Note that (8) has been well studied in the multiuser 
MIMO literature [16] and the solution is therefore omitted. 

When the PU channel is in a good state. Working Mode II. 1 is preferred by the PU but it is considered 
being selfish from the CUs' viewpoint. In contrast, the CU will prefer Mode II. 2 which does not provide 
immediate benefit to the PU in the cooperation. Therefore, a practical scenario would be to regularly 

switch between the two modes to give incentives to both the PU and CU for initiating cooperation. 



^As long as the channel remains static, the optimization is valid and the information exchange between terminals for 
implementing the solution should be acceptable. Re-optimization is only required if the channel changes drastically. 
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III. Special Structure of {A^} and Problem Reformulation 

The problem (7) is a convex optimization problem because the objective function is convex and the con- 
straints can also be made convex by setting the imaginary parts of h]^QA^g^ and hmm'^m to zero [17] 
without loss of optimality. Therefore, (7) can be solved by the standard interior-point algorithm. How- 
ever, there are convincing reasons to further study the problem to i) understand the structure that the 
optimal solution possesses; ii) derive more efficient algorithms than the interior-point algorithm for convex 
problems and iii) develop distributed implementation. The rest of this paper will be devoted to above 
mentioned objectives. In this section, we provide deeper understanding of the optimal solution so that 
physical insights can be gained and more efficient algorithms can be developed. 

A. Optimal Structure of {A^} 

Theorem 1 The optimal A^ has the structure of 



where we have defined the composite channel matrix Hj = [hjo hji • • • hjM] from CBSj to all the PU and 
CUs, and is a complex parameter vector. 



B. Physical Insights and A Simplified Formulation 

Theorem 1 states that the optimal structure of A^ can be divided into two components: gm and Hmam- 
This is quite intuitive as there is only one PU message stream and the optimal A^ is of rank one. It is 
observed that each CBS first maximizes the received SNR of the PU signal using MRC, gt, during phase 
I and then relays the noisy signal using the transmit beamforming vector H^aj during phase II. 

This means that during phase II, the entire system resembles an interference multiple-input single- 
output (MISO) channel where each CBS transmits its own message as well as a noisy version of the 
common PU message using transmit and relay beamforming, respectively. For CBSj, the optimal transmit 
and relay beamforming vectors are both parameterized as Hjaj [18] where a.j is the parameter vector to 



Am = H^a^gTj, Vm, 



(9) 



Proof: See Appendix I. 



□ 
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be designed. Defining Vj = HjSij, the received PU signal (without noise) via CBSj is expressed as 



so = h]o 



-VjgliSjSo + nsj)] = llgjf hJoVjSo + hJoVj-gJn^j. 



(10) 



The overall relaying operation is illustrated in Fig. 2. Note that all CBSs should form a collaborative- 
relaying beam to ensure coherent reception at the PU, i.e., h^^Vj Vj are co-phased. When CBSs act purely 
as relays without serving their users, the structures of {A^} coincide with the results in [19]. 

Each Aj originally is a general N x N matrix but (9) indicates that it is a rank-1 matrix and can be 
represented by a relaying vector Vj with dimension N. As a result, substituting (9) into (7), we get 



M 



M 



s.t. < 



, t II ||2|2 



Ih'l" w |2 

, Ef=i |h]^w,f + Efii(i'o||g,P + iV,,)||g,P|ht^v,|2 + iV„ 



For notational convenience, we define the following vectors and matrices: 



(11a) 



(lib) 



V = vec(vi, . . . , vm), 

ho = [hio||gi||^; • • • ; hMo||gM||^], where ";" denotes vertical concatenation of vectors. 
Ho = BIkDiag (vec (^h|oA/iV^||gi||, . . . , h]^|^^/ NsmWsmW^^ , 

^ BIkDiag (vec (hl^||gi|| v/(Po||gi|P + A^.i), • • ■ ,hi^^l|gM|| V(^o||gM|P + A^.m))) , Vm, 
is a diagonal matrix with its (n,n)-th entry being (Po||gill^ + -^sjOUgjH^! 

for n = {j - 1)M + 1,... ,jM. 



(12) 



Then, (11) can be rewritten as 



mm 

{wj.v} 



M 



-I- v'''D„v s.t. < 



t,.|2 



Ih'v 



E^i|hU,|2 + ||Hovf + iVf 



> 



7o> 



Ih'l" w |2 



E7=l |hl W,|2+||H^v||2 + iV„ 



> 7m, Vm. 



(13) 



Like (7) before, (13) can be equivalently converted into a convex problem. 
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IV. Efficient Algorithms and Implementations 

In this section, we propose two algorithms to solve the dual problem of (13), which can be derived from 
its Karush-Kuhn-Tucker (KKT) conditions and is expressed as 



M 

max AoiVf + V A^iV„ 

Ao,{A^}>0 

I + Aoh^ohJ„o + X] ^n^mrMnn ^ —^rarrMnm^^ ^ ^ (14) 

Tm 



M . 

An 



s.t. < 



M 

+ ^ Xm^l^lm + AoHJHo >1 ^hohj). 

m=l ^0 



where Aq, {A^} are dual variables and their meanings will be explained later. The procedures to derive it 

can be found in the textbook [20] and previous work [17]. We will then discuss in Section IV-C how to 
recover the solution of the original problem after the dual problem is solved. 

A. Algorithm 1 Based on (14) and Fixed-Point Iteration 

The first proposed algorithm is directly based on the dual problem formulation (14). To proceed, we first 
derive an equivalent form of the constraints by rewriting the constraint in (14) as 

i ^ \ ^ \m i ^ \ ^ 

I - I + Aoh^oh|„o + XI Anh^„hJ„„ —YVmnMnm I + Aoh^oh|„o + XI Anh^„hJ„„ >z 0, 

\ n=l j Im I / 

(15) 

and further 

Am < ; — —I . (16) 



Since the objective function increases with A^, at the optimum, the equality must hold; similarly for Aq. 
Algorithm 1: Efficient fixed-point iteration to solve (14). 

Step 1) Initialize A = [Aq, Ai, . . . , \mY arbitrarily. 
Step 2) Update 

Am(A) = ^-^ — T , Vm > 1. (17) 
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Step 3) Update 

Ao(A) = — ^ — (18) 

(d^ + Em=i A^hLh^ + AoHJHoJ ho 

Step 4) Go back to Step 2) until convergence. 

Theorem 2 Algorithm 1 finds the optimal solution to (14)- 

Proof: It is easy to check that the mappings in (17) and (18) satisfy 

• Positivity: A^(A) > 0,Ao(A) > 0; 

• Monotonicity: If A > A', then > A^(A'),Ao(A) > Ao(A'); 

• Scalabihty: For all a > 1, aAm(A) > ATO(aA), aAo(A) > Ao(q:A). 

Thus, they are standard interference functions and the optimality follows directly the results in [21]. □ 

B. Algorithm 2 Based on the Virtual Uplink and Matrix Iteration 

In Phase II, as the channel from the CBSs to all CUs and PU resembles a multicell downlink [22], we 
propose an even more efficient algorithm by exploiting the duality between the original downlink and a 
virtual uplink. In this case, however, we assume that there is one CBS or an additional central unit who 
coordinates the optimization, i.e., a centralized approach. Based on (14), we construct an equivalent power 
optimization problem in a virtual uplink by introducing the auxiliary vectors {w^, v} as 



max n'^A 
A>0 



Am I hmm Wm I 



wm Ao|hj„oW„|2 + Y,n=i A„|hJ„„Wm|2 + llw^iP \^^) 
s.t. ( "^"^ 

Ao|hj)Vp ^ , 

max ^ ^ = < 7n, 

V vtD,v + E^=iAm||H„v||2 + Ao||Hov||2 - 

where n = [N2 Ni ■ ■ ■ Nm]'^ ■ Also, the vectors {w,„} and v can be interpreted as the receive beamforming 
and relay beamforming vectors in the virtual uplink, respectively. 

Since all the constraints in (19) should be satisfied with equality at the optimum, all the inequality 
signs can be reversed and the maximization can be replaced by minimization of the objective function. As 
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a consequence, (19) can be equivalently rewritten as 



mm 

\>0,{Wm},V 



s.t. < 



I hmm | 



Aol 

vtD„v + E^^i A^||H^v||2 + Ao||Hov||2 



t 112 
m 



(20) 



> 



7o- 



The "max" operations in the constraints of (19) are removed as they are automatically enforced in (20). 
The equivalence of (19) and (20) can be verified by showing that a feasible solution of one is also feasible 
for the other. Now, we have a power minimization problem (20) in the virtual uplink where Am can be 
interpreted as the uplink transmit power for CU^, and Aq is the total relaying power over all the CBSs. 
Intriguingly, instead of minimizing the total CBS transmit power in the downlink, we solve it by minimizing 
the total power of relaying and the CUs in the virtual uplink. Also, becomes the receive beamforming 
vector at CBS^ to recover the message from CU^ while v is regarded as the receive relaying vector. 

Corollary 1 The same SINR region {79, 71, ... , 7m} can be achieved in the downlink and the virtual uplink 
described by (20), using the same set of normalized transmit/receive beamforming vectors and relaying 
vectors. Mathematically, that is, in (11) and (20), we have v cx v and Wm oc w^, Vm. 

With Corollary 1, it suffices to solve (20). Before doing so, we define: 



F G 



+ l)x(Af+l) . 



: < 



Fi,i = IIHovf , 

F„,„ = 0, forn = 2,...,M + l, 
F„+i,i = |hJjoW„|^, forn = 1, . . . , M, 



Fi,„+i = ||H^vf , for n = 1, , 



Fm+i,n+i = iHim'^ml^, for m, n = 1, . . . , M, and m / n, 



(21) 



To 



Dm 

Dm,n = 0, for m / n 



|hJ)V|2' 

In 



2, forn = 2,...,M + l, 



(22) 
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^g^(M+l)xl . , 



(T„ = 1, forn = 2, ...,M + 1. 



(23) 



Then, (20) can be written as 



min n^A s.t. A > DFA + Dtr. (24) 

A>0,{Wm},V 



Notice that at the optimum, the constraint in (24) becomes an equaUty constraint. 
Algorithm 2: Efficient matrix iteration to solve (24). 

Step 1) Find a feasible solution (A, {w^}, v) to satisfy the SINR constraints in (24). 
Step 2) Update 



A = (I - DF)-^Do-. (25) 



Step 3) Update 



/ M \ 



-1 



w 



hmm, and Wm = jr^ — H"'^"^- (^6) 



Step 4) Update 



M 



V = + ^ A„hJ„H^ + AoH|)Ho ho, and v = (27) 

V m=l / ll'^ll 

Step 5) Go back to Step 2) until convergence. 

The optimality of Algorithm 2 has been established in [23] provided a feasible initial solution is given. 
To obtain a feasible solution in Step 1, we study the following SINR balancing problem: 

max C 

C,X>0,{Wm},V 



s.t. 



^=DFA + D(T, (28) 
n^A < Pt, 

where Pt is the total power constraint. (20) is feasible if and only if the optimal C satisfies C > 1. It can 
be checked that the first matrix equation constraint in (28) has a similar mathematical structure as [23, 
(34)]. Thus, the algorithm based on an eigen-system proposed in [23, Section IV-C] can be applied to solve 
(28), which results in the first step of Algorithm 2. If a feasible solution is found, the rest of Algorithm 2 
solves the power minimization problem (24) via matrix iteration following [23, Section III-C]. 
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C. Conversion to the Original Downlink Solutions 

After obtaining the optimal solution A to the dual problem (14), we can find the unit-norm beamforming 
vectors using (26) and (27). Having identified {w^} and v, we can decide on the downlink power based 
on the fact that at the optimum, all constraints in (7) are met with equalities; therefore we have M + 1 
independent linear equations sufficient to solve for the (M + 1) x 1 power vector to get 

P = W,Pi-. • • • ;Pm] = (I - DF^)-iDo-. (29) 
As a result, the downlink solution is given by 

V = a/PoV, 

(30) 

A straightforward implementation of the solution is to make the matrix (I — DF-^)~^D known to all 
the CBSs. Alternatively, however, if the beamforming directions are known, then it is actually possible to 
obtain the optimal downlink power locally in a distributed manner which avoids the matrix inversion.^ 

D. Compcirisons Between the Two Algorithms 

D.l Convergence Rate Analysis and Complexity Comparison 

Comparing Algorithm 1 and Algorithm 2, the only difference lies in the way the power is being updated. 
In Algorithm 1, the power optimizations (17) and (18) are based on the fixed point iteration while in 
Algorithm 2, the power is optimized based on the matrix equation (25). Assuming that the beamforming 
vectors are fixed, the power iteration in Algorithm 1 can be rewritten in a matrix form as 

^(n+l) ^ DFA(") + D(T, (31) 

where A^"^ denotes the solution at the n-th iteration. 

In [25], it has been analyzed that such iteration satisfies 

\\\{n+l) _ ^*|| 



"'To do so, each user feeds back information about its received SINR given a power solution, and then adjusts its transmit 
power accordingly to meet its target SINR. This distributed power control is known to achieve the optimum [24] . 
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where A* is the optimal solution and p{-) returns the spectral radius of an input matrix. The result (32) 
indicates that the power update in Algorithm 1 has linear convergence, which is true even the beamforming 
vectors are adapted at each iteration. On the other hand, in Algorithm 2, the power update directly 
optimizes the objective function to exactly satisfy all the SINR constraints with equality, i.e.. 



Hence, it is expected that starting with the same initial point, at each iteration, the power vector generated 
by Algorithm 2 is element- wisely less than that generated by Algorithm 1, which has been formally proved 
in [25] and it is further shown that the power sequence generated by Algorithm 2 satisfies 



which indicates that Algorithm 2 has a superlinear convergence rate. This comparison helps explain why 
Algorithm 2 converges faster than Algorithm 1, and will be verified by simulation results in Section VI. 

For complexity comparison, note that (17)-(18) in Algorithm 1 and (26)-(27) in Algorithm 2 involve 
similar matrix inversion operations, so their complexities are comparable. Compared with Algorithm 1, 
Algorithm 2 needs additional Step 1) to find the feasible solution via eigenvalue decomposition and Step 
2) to find the optimal A via matrix inversion (25). These steps have the overall complexity 0{M^). 

D.2 Distributed Implementation 

For implementation, the power update (33) in Algorithm 2 requires both global CSI and centralized pro- 
cessing. On the contrary, from (31) in Algorithm 1, each CU's power can be updated locally assuming all 
others are fixed, which gives rise to a distributed implementation. This is described as follows. 

The optimization of in (17) of Algorithm 1 requires only the CSI from CBS^ to all the CUs and 
the PU, i.e., = [h^o h^i ■ ■ ■ hmM] defined in Appendix I, and therefore can be implemented at CBS^ 
locally. To obtain A other than (Aq will be discussed next), it can be obtained by a small amount of 
information exchange among all the CBSs (see [15] for details) or feedback from the CUs [21]. After the 
optimal A is known, the optimization of Wm can be readily achieved using (26) with local CSI. 

The optimization of Aq in (18) of Algorithm 1 is less straightforward. At first, it may seem that the 
update of Aq involves global CSI, but in fact the matrices hJ„H^ and HqHq are both block diagonal, and 



A= (I-DF)-iDir. 



(33) 




X 



= 0, 



(34) 
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therefore the right-hand-side can be decoupled. After some manipulation, we can get 

(M \~^M 
^v+Yl ^rn&m^m + AoHJHo ho = ^ , (35) 
m=l / j=l 

where 



»j = WSjWSo ^oNsjhjoh]^ + (Po||g,|P + Nsj)I + J2 >^m{Po\\gjf + Nsj)hj^h\^ hjo. (36) 

\ m=l / 

Note that the SINR is the summation of contributions from all the collaborative CBSs. For CBSj, aj can 
be evaluated using local CSI and A. Thus, the update of Aq can be done in a distributed manner by 

Ao = ^^, (37) 

which requires that the CBSs exchange information of {aj} with each other. 

Once the optimal A is found, CBSj calculates the relay-beamforming vector by 

V,- = \\gjf |^AoiV,,^oh]o + (^'ollgif + N,j)I + >^m{Po\\gjf + Nsj)hjmhl^ hjo, (38) 

which again only relies on local CSI and can be implemented in a distributed manner. 

The structure of Vj is intuitive. If the CBSs only act as relays without serving their CUs, the optimal 
collaborative-relaying strategy is to let Vj match the forward channel {hjo}- From (38), we have 

0^ + ^^^jrnh]^\ hjo. (39) 

V m=l / 

While the CBSs also send messages to their CUs which causes interference to the PU, Vj plays a role to 
form a collaborative signaling beam to compensate the effects of interference. 

To summarize. Algorithm 1 is desirable for distributed implementation when a central controller is 
absent to gather global CSI and coordinate the required optimization. With only local CSI at each CBS 
and moderate amount of parameter exchange, the optimal transmit and relaying beamforming can be 
realized in a distributed manner. Next, we analyze the signalling requirement of the algorithms. 
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D.3 Discussion on Parameter Exchange 

The distributed strategy depends largely on the required parameter exchange between the CBSs in two 
steps. First, Algorithm 1 is used to solve the dual problem and at each iteration, the vectors [Ai, . . . , Am] 
and ex. = [ai,. . ., q;m] need to be shared among the CBSs. The total number of exchanged parameters is 
2NiM where Nj denotes the total number of iterations needed for Algorithm 1 to converge. Then, for 
the uplink-downlink conversion, the matrix (I — DF"^)^^D needs to be made available to all the CBSs, 
requiring (M + 1)^ positive scalars to be shared. The total number of parameter exchange is therefore 
2NjM + {M + 1)^, regardless of the number of transmit antennas, N. As a comparison, full CSI exchange, 
which would make possible a centralized optimization (e.g.. Algorithm 2), will require 2NM'^ scalars to 
be shared where the factor of 2 is due to the complex nature of the parameters. If the same quantization 
scheme is used, the ratio of the number of exchanged bits in Algorithm 1 over full CSI exchange is 

When the number of CBSs, M, is large, a great reduction in backhaul signaling is anticipated. 



V. Two Zero-forcing Solutions 

This section aims to develop two low-complexity closed-form zero-forcing solutions to (7). 
A. Transmit Zero-forcing Beamformer 

For coordinated beamforming of the CBSs, we write Wm = >/p^Wm, where ||wm|| = 1. For zero-forcing, 
should be chosen such that there is no interference from CBS^ to the PU and all other CUs, i.e., 

wJ„H„ = 0,Vm, (41) 

where = [h^o ^mi " " " ^mm-i ^mm+i ■ ■ ■ hj„M] has been defined in Appendix I as the composite 
forward channel from CBSj to all the PU and CUs. One possible solution for the transmit zero-forcing 
beamforming is expressed as 



w™, = 



(l-H„(Hj„H„)-iHL)h, 



,Vm. (42) 
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With the above solution, the original problem becomes 



M 



M 



mm 
{Pj 



ill, + II(^o||g,-f + A^.,-)l|g^-f l|v,-| 



s.t. < 



I vIIq--I|2|2 
AT ■lhW-|2||o-'''||2 -u A/"^ 



[ E^iC^lllgiP + A^..-)l|g.-P|h]^v,f + AT, 



2|ut 



> 7o> 



B. CZF Relaying Vectors 

With CZF, the design of v, that ensures no interference to all the CUs should satisfy 



v]Hc^ = 0, 



where Hem — [hji, . . . , hjAf] and Vj is chosen to maximize the received SINR for the PU s 



V,- = (l- Hem(HLH,m)-iHL) h,o||g,f . 



Note that (45) also ensures that all h^QV^ are co-phased for coherent reception. Defining 
to have v = ^Jpov, (43) is then reduced to the following power minimization problem: 



M 



mm 



in Pj +pov^D-^v 



r lut -12 



s.t. < 



Po||Hov||2 + iVf 

111'*' w |2 



Pn 



>7o 



Prom the constraints in (46), we can determine the minimum required transmit power as 



Po 



Pn 



7>f 



hLo^P -7ol|Hov|, 



-, Vm. 
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C. PZF Relaying Vectors 



In PZF [13], V aims to maximize the PU's channel without controhing the interference to CUs, i.e 



V 



[hio; • • • ; hjuo] 

|[hio;-- - ;h 



MOj 



This gives priority to the PU's transmission at the expense of the CUs. With (48), (43) becomes 



M 



mm 

{Pj,PO 



^ .7=1 



PolW„oV 



Po||Hov||2 + iV| 

Prn I hmm | 



> 7o> 



Similarly, we can determine the minimum required transmit power as 



7>f 



Pm = In 



Ko^r -7ol|Hov| 

Nm +Po||HmV||' 



, Vm. 



D. Compcirison for a Special Case: One CRS, One CBS and One CU 
Theorem 3 When M = 1 and r < 0.5 bps/Hz, PZF always uses less power than CZF. 
Proof: When M = 1, CZF yields 



Po 



Pi 



|hioP(l-/92)(||giP-^;iV,i)||giP' 

7iAri 7iiVi 



IhtwiP ||hnP(l-p2)' 



where p = ||h^p|||hi[|| • The total power can be found as 



pCZF 



+ 



hn||2(l - p2) ||hio||2(l - p2)(||g,||2 _ ^^AT^^) ■ 



On the other hand, the PZF power solutions are reduced to 



Po 



Pi = 71 



7>| 

hioP(||giP-7>.i)l|gIf 

Pollglf ^|^(Po||glf + N,l) +N, _ ^2^^||gt||2||h^^||2(p^||g^||2 + + 



|hiiP(l-p2) 



71- 



|hiiP(l-p2) 
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Thus, the total power for the case of PZF is found as 



pPZF ^ l|l'io||-Ul8i||--"()-^.-'iJ , ,-o^\2 U + ^ Oll&lll j 

^ ||hnP(l-/92) ^ ||hioP(||gP-7;iV,i) 

7i7oA^2V(^o||gif + Ar,i) TiATi 7;jV|'(Po||g,||2 + ^r,,) 

^ IIU 112^1 „2\ 111- II9/II_ 119 _/ AT \- V"'^/ 



|hio|P(||gi||2 - 7;iV,i)(l - p2) ||hnP(l - p2) ||hio||2(||gi||2 - 7;iV,i) ' 



As a result, we get 



pCZF pPZF 



l|hioP(||giP-7oA^.i) Vl-P^ l|hioP(||gP-7;) ^ ^ 

which indicates that PZF uses less power if and only if 71 < 1, or n = 5 log2(l + 71) < 0.5 bps/Hz. □ 



VI. Simulation Results 

Computer simulations are conducted to evaluate the performance of the proposed algorithms. We assume 
that the PBS has K = 2 transmit antennas and there are M = 3 CBSs, each with = 4 antennas, serving 
3 CUs. The channel between any antenna pair is modeled as h = d~^e^^ , where d is the distance, c is the 
path loss exponent, chosen as 3.5, and is uniformly distributed over [0, 27r). The distances from the CBSs 
to the PBS, PU and CUs are all normalized to one while the distance from the PBS to the PU is set to 2 
units. Thus, the primary channel is much weaker than other links. The PU's target rate is ro = 2 bps/Hz 
unless otherwise specified. For convenience, all the CUs have the same target rate r and noise power levels 
at all terminals are the same. We define the transmit SNR, transmit power normalized by noise power, as 
the power metric. For each result, 10^ channel realizations are simulated and averaged. Outage is said to 
occur when tq is not supported in the primary system for a channel instance and in this case the primary 
system will have cooperation with the CBS. When outage does not occur even without cooperation, the 
CBS operates in Mode II.2, i.e., it transmits to the CUs in the orthogonal space of the PU. 

We study the convergence behaviors of Algorithms 1 and 2 in Fig. 3 for a typical channel realization 
assuming r = 1 bps/Hz. For fairness, both algorithms start with a zero power vector initialization. The 
relative change in the total power is shown as the iteration goes. Results illustrate that 5 iterations are 
sufficient for Algorithm 2 to converge while Algorithm 1 converges much slower. Fig. 3 also reveals that 
Algorithm 1 and Algorithm 2 have linear and super linear convergence rates, respectively. This verifies our 
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theoretical analysis that Algorithm 2 converges significantly faster than Algorithm 1. 

The impact of PBS power on the required total CBS transmit power is studied in Fig. 4 assuming that 
each CU requires a minimum rate of 2 bps/Hz. Results show that the optimal solution greatly outperforms 
both zero-forcing solutions and saves about 3 dB in transmit SNR for the CBSs when the transmit SNR 
for the PBS increases from 4 to 12 dB. Also, PZF uses about 4 and 10 dB more power than CZF and 
the optimal solution, respectively. This is because PZF gives priority to the PU and compromises the 
performance of CUs, but for CZF, the CUs and the PU are equally important. 

Fig. 5 demonstrates the cumulative density function (CDF) of the CBS transmit SNR against the CU 
target rates, assuming that the PBS transmit SNR is 10 dB. As we can see, when r = 0.1 bps/Hz, PZF 
outperforms CZF because the PU's rate constraint dominates, thereby giving priority to the PU. When 
r = 0.5 bps/Hz, CZF and PZF have almost identical performance, whereas when r > 0.5 bps/Hz, CZF 
greatly outperforms PZF as PZF gives priority to the PU and compromises the performance of CUs. 

Next we examine the effect of the CBS power on the PU's outage performance in Fig. 6. Results for 
"No Cooperation" in which the PU uses the entire available bandwidth for transmission without letting 
the CBSs to use part of the bandwidth for cooperation is provided for comparison. We consider that the 
CUs have a target rate of 2 bps/Hz and the PBS's transmit SNR is 10 dB. Results reveal that outage 
occurs with a probability of 85% for the PU but the situation greatly improves when the PBS cooperates 
with the CBSs and if the CBS transmit power is significant. At low to medium CBS transmit SNR (0-15 
dB), cooperation does not pay off (but also does not worsen the PU outage performance) but as the CBS 
transmit power increases beyond 15 dB, the PU's outage probability drops rapidly due to the increased 
diversity from the CBSs, showing that cooperation is a promising approach to improve the PU link. 

Finally, we evaluate the robustness of the proposed algorithms against imperfect CSI knowledge between 
CBSs and the PU. To model the errors, we assume that 

hjo = hjo + Ahjo, Vj, (56) 

where hjo is the CSI known at CBS j and Ahjo ^ CJ\f{0, ^|l) denotes the CSI error. We consider a system 
with one CBS and one CU and the PBS's transmit SNR is 10 dB. Fig. 7 shows the achievable PU rates 
(with a target of 2 bps/Hz) for the three proposed schemes against It is seen that the optimal solution 
and CZF solution are quite robust while PZF is very sensitive to the error. This can be explained that in 
PZF, CBSs choose beamforming vectors aligned with {hjo}, thereby suffering most. Another important 
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observation is that the achievable rate for the PU is worst without cooperation and even at high level of 
CSI errors, cooperation gains and the CZF appears to be a promising cooperation strategy. 



VII. Conclusion 

We addressed the jointly optimization problem of collaborative relaying at CBSs to relay the primary 
signals, and coordinated transmit beamforming for cognitive transmission. The problem of minimizing 
the CBS transmit power subject to minimum rate requirements for the PU and the CUs was studied. 
The optimal structure of the relaying matrix was shown to first match the backward channels and then 
retransmit the noisy PU signal using collaborative relay beamforming. Two efficient algorithms to find the 
optimal solution were proposed and compared. A novel distributed implementation was also devised for 
the cooperation. Also, suboptimal but closed-form CZF and PZF solutions were presented. 

Appendix I. Proof of Theorem 1 

Our derivation is based on the analysis of KKT conditions. To do so, we write the Lagrangian of (7) as 

M M M 

^ = Y1 ii^.-f +^oE ii^^-siii' +E^-^-iiA^-ii' 

j=l j=l j=l 

( M M ly-M , t 



7o 



M I M M M I t |2 

+ E^- El^™-.l' + ^oEl^^A,g,i^ + 5:Ar,,||ht„A,||^^ , (57) 

m=l \ i=i j=\ j=l ™ 



where Aq, {A^} are dual variables. Setting = leads to 



M M 



m=l m=l 

Ao Ao ^ 

= 4hjoh]oAjgjg] + 4 E hiohj^oAng^g], (58) 
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where we have used ^trace(ATiAtT2) _ rp^ATi. Re-organizing (58), we obtain 
Nsj (l + Aohjohj.Q + X^hjmh]^ Aj + Po(l + Aohjohj.Q + ^ A^hj„,h].^'j A^g^g] 

V m=l / V m=l / 

Ao Ao ^ 

To To „=i 

(^Nsjl + Pogig]) « ( I + Aoh,oh]o + ^ A^hj^h]^ j h,og]. (59) 
Applying the matrix inversion lemma to (59), we have the result 

I I + Aohjohtp + Xmhjmh]^ j hjogj 

V m=l / 

(M \ ~1 / M \ 

I + X] ^rniljm'fa^]rn ] ^^jOg] OC I Qjohjo + ^ aj^hj^ 1 g]. (60) 
m=l / V m=l / 

Combining (59) and (60) gives 

Aj oc I a^ohjo + ajrni^jrn 1 g] (iV^jl + ^'ogjg]) (61) 

V m=l / V m=l / 

where we have used the matrix inversion lemma and ajo, {ajm} are all (complex) constants. 

Now, we use a composite channel matrix Hj = [hjQ hji ■ ■ ■ hjM] from CBSj to all the PU and CUs, and 
introduce a new vector = [ajo aji ■ ■ ■ aju]^ ■ Then, we reach the optimal structure in the theorem. 
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Figure 1: The cooperating cognitive system. 
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Figure 2: Illustration of the relaying processing at the CBSs. 
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Figure 3: The convergence behavior of the proposed algorithms. 
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Figure 4: The required CU power versus the PBS power. 
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Figure 5: The CDF results of the average CBS power. 
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Figure 6: The PU outage probabihty against the CBS power. 
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Figure 7: The PU rate against the channel error. 
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